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Abstract — We consider the asymptotic behavior of the po- 
larization process for polar codes when the bloclslength tends 
to infinity. In particular, we study the problem of asymptotic 
analysis of the cumulative distribution P(^n < z), where 
Zn — Z{Wn) is the Bhattacharyya process, and its depen- 
dence to the rate of transmission R. We show that for a 
BMS channel W, for R < I{W) we have lim„^oo P(^n < 

+°'^') = i? and for i? < 1 - I{W) we 

have lim„^ooP(^n > 1 - 2^^^^ ' ) = R, 

where Q{x) is the probability that the standard normal random 
variable will obtain a value larger than x. As a result, if we 
denote by ¥l^{n,R) the probability of error using polar codes 
of block-length A'^ — 2" and rate R < I{W) under succes- 
sive cancellation decoding, then log( log(P^'^(?i, i?))) scales as 

J + ^Jn — 1^ o(y'n). We also prove that the same result 

holds for the block error probability using the MAP decoder, i.e., 

for log(-log(Pr^''(n,i?))). 

I. Introduction 

Polar codes, recently introduced by Arikan |[T], are a fam- 
ily of codes that provably achieve the capacity of binary 
memoryless symmetric (BMS) channels using low-complexity 
encoding and decoding algorithms. The construction of polar 
codes involves a method called channel polarization. In this 
method, N = 2" copies of a BMS channel W are used 
to construct a set of 2" channels {W2^''}i<i<2" with the 
property that as n grows large, a fraction of almost I{W) 
of the channels have capacity close to 1 and a fraction of 
almost 1 — I{W) of the channels have capacity close to zero. 
The construction of these channels is done recursively, using 
a transform called channel splitting. Channel splitting is a 
transform which takes a BMS channel W as input and outputs 
two BMS channels and W~. We denote this transform 
by VF ^ {W+,W-). 

For N = 2", the construction of the channels can be 
visualized in the following way (|1|). Consider an infinite 
binary tree. To each vertex of the tree we assign a channel in 
a way that the collection of all the channels that correspond to 
the vertices at depth n equals {W2n}i<i<2'^- We do this by 
a recursive procedure. Assign to the root node the channel W 
itself. To the left offspring of the root node assign W~ and to 
the right one assign W^. In general, if Q is the channel that 
is assigned to vertex v, to the left offspring of v assign Q~ 
and to the right one assign Q+. 
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Remark 1: In this setting, the channel assigned to a vertex 
at level n, is obtained by starting from the original channel W 
and applying a sequence of + and — on it. More precisely, 
label the vertices at level n from left to right by 1 to 2". The 
channel which is assigned to the i-th vertex is Wj^'. Let the 
binary representation of i — 1 be 6162 • where hi is the 
most significant bit. By the mapping ^ — and 1 — > +, every 
binary sequence &i52 • • • 6„ is converted to a sequence of + 
and — , denoted by C1C2 • • • c„. Then we have 

E.g., assuming i = 7 we have W^^ = {{W^)^)^ . For a 
BMS channel W, denote the input alphabet by A" = {0, 1}, 
the output alphabet by 3^, and the transition probabilities by 
W{y\x). The Bhattacharyya parameter of W, denoted by 
Z{W), is given by 

Z(W) = VW{y\0)Wiy\l). 

The distribution of the Bhattacharyya parameter of the chan- 
nels {W^l^} i<i<2" plays a fundamental role in the analysis 
of polar codes. More precisely, for n G N and < z < 1, we 
are interested in analyzing the behavior of 

There is an entirely equivalent probabilistic description of ([T]i. 
Define the "polarization" process (||2l) of the channel W as 

Wo^W and 

w ^ j ; with probability i 

^^n+i <y ;with probability I . ^ ' 

In words, the process starts from the root node of the infinite 
binary tree and in each step moves either to the left or the 
right offspring of the current node with probability ^. So at 
time n, the process Wn outputs one of the 2" channels at level 
n of the tree uniformly at random. The Bhattacharyya process 
of the channel W is defined as Zn = Z{Wn)- In this setting, 
we have: 

P{Z„<z)=F{n,z). (3) 

Our objective is to investigate the behavior of P(Z„ < z). 
The analysis of the process Zn around the point z = 
is of particular interest since this indicates how the "good" 
channels, i.e., the channels that have mutual information close 
to 1, behave. According to ||2|, the process Z„ is a super- 
martingale which converges almost surely to a {0, l}-valued 
random variable Z^q with P(Zoo = 0) = I{W). We further 
have. 
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Theorem 2 (^): Let be a BMS channel. For any fixed 

liminf P(Z„ < 2-^"") = I{W). 
Conversely, if I{W) < 1, then for any fixed /3 > i. 



liminf P(Z„ > 2" 



= 1. 



As a result, the probability of error when using polar codes 
of length N = 2" under successive cancellation decoding 
behaves roughly as o{2~^) as N tends to infinity. Denote 
the error probability by Pf"{n,R). In this paper, we provide 
a refined estimate of P(Z„ < z). We derive the asymptotic 
relation between P(Z„ < z) and the rate of transmission R 
when polar codes with a successive cancellation decoder are 
used. From this we derive bounds on the asymptotic behavior 
of V^'~{ti,R) . We further show that the same bounds hold 
when we perform MAP decoding. The outline of the paper 
is as follows. In Section we state the main results of the 
paper. In Section |III] we first define several auxiliary processes 
and provide bounds on their asymptotic behavior. Using these 
bounds, we then prove the main results. 

II. Main Results 

Proof of the following theorems is given in Section Hill 
Theorem 3: For a BMS channel W, let Z„ = Z{Wn) be 
the Bhattacharyya process of W. 
1) For R < I{W), 



lim P(Z„ < 2' 

n— >-oo 

2) For R<1- I{W), 



)^R. 



lim 

7l~>00 



> 1 



) = R. 



Here, f{n) is any function so that f{n) = o{\/n) and 
lim„^oo f{n) = oo. The function E{n, x), < R < 1, 
is the unique integer solution of the equation 



E 

=E(n,x) 



< 2"a; < 



E 



i—E(n,x)~l 



(4) 



Discussion: Theorem [3] characterizes the asymptotic behavior 
of P(Z„ < z). By the StirUng formula applied to the 

function E{n, j^^) behaves like ^ + ^/n +o(.y/n), 

where Q{x) is the probability that the standard normal random 
variable will obtain a value larger than x. Thus by Theorem [3] 
part (1) we have 



lim P(Z„ < 2' 



Q-H 



R. 



This refines the result of Theorem |2] in the following way. 
According to Theorem IH if we transmit at rate R below 
the channel capacity, then log(— log(Pg'^(7i, i?))) scales like 
^ + o(n). Theorem 13 gives one further term by stating that 

o(n) is in fact ^/n— — ''J'" ' +o{y/n). The proof of Theorem[3] 



is based on observing that, once the process Z„ is close 
to either of the endpoints of the interval [0,1], it moves 
closer to that endpoint with high probability. As a result, the 

(i) 

quality of a channel is greatly dependent on the first 

few less significant bits of the binary expansion of i — 1. This 
observation together with the result of Theorem [3] imply the 
following. 

Corollary 4: Let be a BMS channel and let R < I{W) 
be the rate of transmission. The fraction of common indices 
chosen by polar codes and Reed-MuUer codes (normalized by 
2"i?), approaches I{W) as n — s- od. ■ 

Theorem [3] characterizes the scaling of the error probability 
of polar codes under the successive cancellation decoder The 
same result holds for the case of the MAP decoder. 

Theorem 5: Let W be a BMS channel and let R < I{W) 
be the rate of transmission. Let C{n, R) be a linear code whose 
generator matrix is obtained by choosing a subset of 2"i? rows 
of [11]^" (e-g-, polar codes or Reed-Muller codes). Denote 
by Ic the set of the indices of the chosen rows and also denote 
by P^^^{n, R), the block error probability when we use the 
code C for transmission and decode according to the MAP 
rule. We have 



wt(i)+l + log(- log(Z(W))) 



where wt(i) denotes the number of I's in the binary ex- 
pansion of i. As a result, for every such code, we have 
log(- log(P^^P(n, i?))) < f + v^SllLR) + o(V^). Also for 
the case of polar codes we have log(— log(Pp^^(n, i?))) < 

Discussion: By this theorem, for polar codes we have 



log(- log(P^AP (n,i?))) < f + 01. 



- o(^/n). Now 



since P^ (n, i?) < P^ (n, i?), for the case of polar codes 
log(- log(P^^P(n, i?))) scales as ^ + ^S-lh&ll+o{y^). 

III. Proof of the Main Result 

A. Analyzing closely related processes 

In this part we consider several auxiliary processes and 
provide bounds on their asymptotic behavior Let {iJnjrieN 
be a sequence of iid Bernoulli(i) random variables. Denote 
by (J^, il, P) the probability space generated by this sequence 
and let (J^„,S7„,P„) be the probability space generated by 
{Bi, ■ ■ ■ , Bn). Also, denote by On the natural embedding of 
Fn into F, i.e., for every F £ Fn 

Bn{F) = {(&i, 62, • • • , bn,hn+i, (&i, • • • , 6„) e F]. 

We have P„(i^) = P(6l„(F)). We now couple the process W„ 
with the sequence {Bi}: 



W+_^ ;ifB„ = l, 



n-l 



;if B„ =0. 



(5) 



As a result, Z„ ~ Z{Wn) is coupled with the sequence {Bi}. 
By using the bounds given in 13] Chapter 4] we have the 
following relationship between the Bhattacharyya parameters 
of W+, and W: 
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Z{W)^2 - Z{WY < Z{W~) < 2Z(W) - Z{Wf. 

As a result, for a BMS channel W, the process Z„ = Z{Wn) 
satisfies (||4] Lemma 3.16]) 



— Zn-l' 



;if S„ = 1, 



Zn-l , '2Zn — Zri-\ 1 ;ifS„ 



Consider two processes Z," and Z\ given by Z'q 

ZiW), 

;ifi?„ = l, 
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(6) 



and 



2Z;^_i ;ifB„=0, 



iZl,_,)^ ;ifi?„ = l, 



;if B„ = 0. 



(7) 



(8) 



Clearly, Z„ stochastically dominates Z'j and is stochasti- 
cally dominated by Z^. Also, it is easy to see that = 
{Z{W)^^^='''\ Thus 



nZn>{Z{W)f^'=' ') 
= 1. 



(9) 



The following lemma partially analyzes the behavior of Z". 
Lemma 6: For the process (defined in dTji) starting at 
= e (0, 1) we have: 



''(Z,'^ < 2-'^2S^=i 2^+* V^. (10) 

Proof: We analyze the proces^ An = — log(Z,") , i.e., 
Aq^- log(z^) = ao and 



2A„ 



;if Bn = 1, 
1 ;ifB„ = 0. 



(11) 



Note that in terms of the process An, the statement of the 
lemma can be phrased as 

2 



»(A„>/32^"=i^')>l 



Associate to each ,6r,) — i^n G iln a sequence of 

"runs" (ri, • • • , r^i^^^)). This sequence is constructed by the 
following procedure. We define ri as the smallest index i G N 
so that bi+i ^h\. In general, if X]^=i < " '^^en 



rj. = mm 



fc-i 



fe-l 



The process stops whenever the sum of the runs equals n. 
Denote the stopping time of the process by fc(aj„). In words, 
the sequence • • • , 6„) starts with hi. It then repeats h\, r\ 
times. Next follow r2 instances of h\, followed again by 
instances of fei, and so on. We see that h\ and (ri, • • • , r^^j^^j) 
fully describe lOn = • • • Therefore, there is a one- 

to-one map 

(6i, • • • , 6„) < — > {fei, (ri, • • • , (12) 

'in this paper, all the logarithms ai'e in base 2. 



Note that we can either have 6i = 1 or 6i = 0. We start with 
the first case, i.e., we first assume = 1. We have: 



j=l j odd < fe(cj„) 



and 



Analogously, for a realization (61,62, ■■) — ^ G ^ of 
the infinite sequence of random variable {SijigN, we can 
associate a sequence of runs (ri, r2, • • • ). In this regard, 
considering the infinite sequence of random variables {S jigN 
(with the extra condition Bi = 1), the corresponding sequence 
of runs, which we denote by {i?A;}fceN, is an iid sequence with 
P(i?i = — Let us now see how we can express the 
An in terms of the ri, r2, • • • , r^.)^^^). We begin by a simple 
example: Consider the sequence (61 = 1, 62, • • • , 63) and the 
associated run sequence (ri, • • • , rs) = (1, 2, 1, 3, 1). We have 

Ai=aa^\ 

A3 =ao2^i -r2, 

A4 = (ao2''^ - r2)2-^ = ao2'-i+'-^ - r22-^ 
Aj - (ao2''^ - r2)2'-^ - n - ao2'^i+'-^ - r22-^ - r^, 
As = ((ao X 2'^! - r2) x 2''^ - x 2^^ 
= ao2''i+''^+''= - r22''^+''= - r42''= 
= 2''i+''='+'''^(ao - 2-''V2 - 2~(''i+''-^V4). 

In general, for a sequence (61, • • • , 6„) with the associated run 
sequence (ri, • • • , we can write: 

A„ = ao2^' "'!''<''■('-'.)''' - ^ 7^.2^, < 3„jd '■j 

i even < k(ujn ) 
i even < k{cUn ) 

= [2^' ""1 [ao - ( ^ r.2-5:jajj< 

'i even < k{ujn ) 

= [2^"-i«-][ao-( J2 r,2-^.-<^^0]- 

i even < k{cOn ) 

Our aim is to lower-bound 

P(A„ >/?2^"=i^') 

= P„(ao- ^ r,2-^."^^<-'^^ >/?), 

i even < k{u}n) 

or, equivalently, to upper-bound 

P„( J2 r,2-^.-i^<''^^ >ao-/3). (13) 

i even < k{ujn) 

For n G N, define the set C/„ G J>j as 

C/„ = {a;„Gf^„|3?<fc(tj„): ^ n2~^' ao~ 13} . 

i even < / 

Clearly we have: 

P„( '^■'2" "^^ < ' > ao - /?) < Pn(C/n)- 

i even < k{uJn ) 
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In the following we show that if (61, • • • , 6„) G t/„, then for 
any choice of &„+i, (61, • • • , 6„, &„+i) G J/n+i- We will only 
consider the case when 6„, = 1, the other three cases can 
be verified similarly. Let a;„ = (61, • • • , fo„-i, 6„ = 1) G ?7„. 
Hence, fc(w„) is an odd number (recall that 61 = 1) and the 
quantity I]i even < fc(w„) ^^^^ ' depend on 

fk{ui„)- Now consider the sequence ojn+i = {bi,--- ,bn = 
1,1). Since the last bit (bn+i) equals 1, then r^-i^^^^^-f = 
rk{uj„) and the value of the sum remains unchanged. As a 
result (61, •••,&„, 1) G Un+i- From above, we conclude that 
di{Ui) C 9i+i{Ui+i) and as a result 

P,(C/,) = P(0.(C/,;)) < P(a,+i([/,+i)) = P.+i(f/,;+i). 

Hence, the quantity lim„_^oo Pn(tAi) = 

]P(U"=i^i(C/i)) is an upper 
bound on ( fT3] l. On the other hand, consider the set 

y = {wGt^|3/: ^ r,2"^j°«<'''^' >ao-/3}. 

?' even < I 

By the definition of V we have U^]^6'i(C/i) C V, and as a 
result, f{\J'^^e,{Ui)) < F{V). In order to bound the proba- 
bility of the set V, note that assuming Bi = 1, the sequence 
(i.e., the sequence of runs when associated with the 
sequence {Bi}i(zf^) is an iid sequence with P(i?i = j) = -jj. 
We also have 



\ao- ^ i?,,2"^^ '»ii<''^^ < /3) (14) 

i even < m 

P( R^2~ °<"i < ' >ao- 13) 

i even < m 



< 



where the last step follows from the Markov inequal- 
ity. The idea is now to provide an upper bound 



on the quantity Ep^-™ < - ^-2- - < ■ j 
Eieven<™^.2-^.-<^«^ Wehave 

OC 



Let X = 



1=1 

00 

1=1 
(=1 



E[2^]E[22 



1 



S2'(2^-F 



■E[2?' 



h 1 

<^ (E[2^])jr, 



where (a) follows from the fact that i?iS are iid and X is self- 
similar and (b) follows from Jensen inequality. As a result , an 



upper bound on the quantity E[2^] can be derived as follows. 
We have 



1 



(E[2^'])^ 



4(2* - 1) 
1 „ i , 1 



(E[2^])i4 



^ (E[2^])i 



4(25 _ 1) 



E[2-f <- 

"2(24-1) 

The equation y — — i y ^ -1 r 

2(2^-1) 4(23-1)" 4(2S-1) 

only one real valued solution y* < 2.87. As a result we have 
E[2-^] < y* < 2.87. Thus by (O we obtain 



-2/ = 



-yl 



-ys has 



P(ao 



< . < /3) < 



i even < m 

Thus, given that Bi ~ 1, we have: 



2.87 

210-/3 



> /32^^=i^O > 1 



2.87 

2^0-^ ■ 



2.87 



Or more precisely we have 

P(A„>/32^?-i^M5i = l)>l 210-/5 ■ 

Now consider the case Bi = 0. We show that a similar bound 
applies for An- Firstly note that, fixing the value of n, the 
distribution of i?i is as follows: ¥{Ri) = i- for 1 < i < n - 1 
and P(i?i = n) = We have 

P(A„ > /32S?=i ^' I Si = 0) 

n 

= ^ P(A„ > 132^^=^ I -Ri = -Bi = 0)P(iii = i I Bi = 0) 

i=l 

= ^ P(yl„ > /32S?=i I -Ri = ^ Bi = 0)P(fli = i I Bi = 0) 

n 

+ ^ P(iii = i I Bi = 0) 



E 



1 2.87 2 
+ 



2.87(ao - /3 + 1) 

2<io-/3 



2« 2"o-/3-i 2"o-/3 



2 2 



Hence, considering the two cases together, we have: 

2 



P(A„ > /32^?=i^') > 1 



2^ 



As a result of the above lemma, if the initial point of the 
process Z," is sufficiently close to zero, its behavior is close 
to the behavior of the process Z'^. The same phenomenon 
occurs for the process Z„ since it is sandwiched between 
and Z'^. The following statement relates the behavior of the 
processes Z^ and Z^. 

Corollary 7: Let be the process given in (|7|l with = 
zl G (0, 1). For X G (0, 1) we have 



(15) 



Proof: Recall E{n, x) from (|7]) and let the two events A 
and B be defined as follows. 



A = {(6i,--- ,6„)GajZ,'^(5i,--- ,6„)<2-2"'=^ 



S = {(6i,...,6„)GJ7„|2-2^--^' 
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By inserting (3 = 1 in Lemma |6] we obtain P{A) > 1 
2V2y^ and 

n 

F{B) = Bi > E{n,x)) 



As a result, 



> x-o{-^). 
\Jn 



F{Z„, <2^^ ) 



> P{AnB) 

= F{A) +P{B) -F{A(J B) 

> P{A)+F{B) - 1 



B. Proof of Theorem \3\ 

We start with the proof of part (1). The main idea behind 
the proof is to analyze the behavior of the process Z„ once its 
value is sufficiently close to the endpoints of the interval. In 
this regard, we first give a bound on the speed of converging 
to the endpoints. The proof of following lemma is given in 
the appendix. 

Lemma 8: Let be a BMS channel and Z„ = Z{Wn) be 
the corresponding Bhattacharyya process. Let p G ((^^) * , 1) 
be a fixed constant. There exist constants ai,a2 > 0, inde- 
pendent on p, such that 

(a) P(Z„ < 2p") > I{W) - aipt. 

(b) P(Z„ > 1 - 2p") > 1 - I{W) - a2p". 

■ 

We then proceed by by providing upper and lower bounds on 
the quantity 



P(Z„ < 2 



_2E(„,.)(i+e(-B«lll^))^ 



and by showing that as n grows large, both of the bounds tend 
to R. 



>R- 2^/2alp" - [2^/2I{W) + xai)p'^ - o(^). 

Thus by changing the variable n n+m, for every m,n E N 
such that n > m we have 



2-B(",x) 



(17) 



>R- 2V2aip"' - {2V2I{W) + xai)p'^ - 



y/n — m 

2) Upper bound: Consider m and x as above. By (|9]l we 
have: 



As a result. 



Zra < 2p^) = 1. 



Therefore, 



(18) 



> f{{Z{W)f"^^^''"'^'"^ > (Z(W))^'"2'''"'"' \Z^ < 2p™) 



"(Y. E{n,x)\Z^ < 2p™) 



i— m+1 
n+m 



= P( ^ Bi< E{n,x)) 



> l_2;-o( — ), 

and 

P(Z„ + „ > {Z{W)f"^^^'""'\Zm < 2/9'") 

= P(^n+m > {Z{W)f"^'''""'' I Z„ < 2p™)P(Z„ < 2^"") 
> {l-x-o{^))iI{W)~a^p'^). 

As a result, we have 

P(^n + m < (Z(M^))'"''""'"' , < 2p™) (19) 
= P(Z„ < 2p'") - P(Z„+„ > < 2^") 



]) Lower bound: Fix m G N and let x = j^. By < 1 - (1 - I{W) - a2p"') - {1 - x - o{ — )){I{W) - aip ^ ) 



Lemma [8] we have: 
As a result, 



< 2"^ 



> P(^„+™ < 2-^ ^ ■ I < 2p^)¥{Z„, < 2p^) 

> nZn+rn < 2-2^'""' | < 2p™)(/(iy) - aip^) 

> (x - 2V2p^ - o(^))(/(W^) - aip^), 

where the last inequality follows from Corollary [T] and the fact 
that assuming Zm < 2p™, the process Zn+m is dominated by 



< I{W) - (1 - x)I{W) + aap'" + aip- + o( — ) 

Vn 

= xI{W) + a2p" + aip^ + o(-^). 
Also note that 



(20) 



+ nZn+m < {Z{W)) 



, Z^ > 2p" 



We now upper bound the quantity P(Z,i+„i < 
(Z(Vr))2'"2^*""',Z,„ > 2p™). Firstly note that as m 



the process Z^ with the initial condition Zq 



2/9™. grows large we have {Z{W)) 



2"2^<"--) 



< 2^". More 



Now, since i?(n + m,x) — m < E(n, x) and a;/(M^) = R, we 
have 



^{Zn+m !i 2 ) 



precisely if we choose m large enough so that the inequality 

logp 



(16) 



2™ > TO 



logZ(W^)' 



(21) 
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is fulfilled, then the relation (Z(W^))2"'2^'" "' < 2p™ holds. 
For this choice of m we have 



Therefore, 



¥{Zn + ,n < {Z{W)y 



, Z,n > 2/9" 



= P(Z„+„ < 2p^2p^< Zm < 1 - 2p'")P(2p'" <Z„, <1 - 2p™) 

+ P(Z„+„ < 2p"' i > 1 - 2p'")P(Z,„ > 1 - 2p'") 
< P(2p'" < Z,„ < 1 - 2^"^) + ViZ.n+m < 2p™ I Z„ > 1 - 2p™). 

Now, by Lemma |8] it is easy to see that 

P(2p'" < Zm<l- 2p'") < 1 - {I{W) - a^p"^) 
- (1 - I{W) - a2pn 
= aip~ + a2p"^ ■ 

Also to upperbound P(Z„+™ < 2/9™ | > 1 - 2p"'), note 
that if we consider the process En given in dZSl l with the initial 
condition eo = 1 — 2p™, then as a result of Lemma [TO] we 
have 

nZn+.^ < 2p" I > 1 - 2p") 

< P{En < 2p™) 

< 2V2^/l - (1 - 2/9'")2 

< 8/9^. 

Summing up the above arguments, we have 

P(Z„+,„ < 2p", Z„, > 2p'") < (ai + 8)pT + a^p". (22) 

And as a result, for m large enough so that (ISTT i is fulfilled 
we have 

P(Z„+™ < (Z(M^))2'"2"<"-\ > 2p-) 
< (ai +8)/9T +a2/3". 
Plugging this into ( |20] | and using ( fT9l ). we have 

< + 2a2/3™ + (2ai + 8)p^ + o(^). 
Also, since E{n, x) < E{n + m, x) and xI{W) — i? we have: 

< + 2a2/9™ + (2ai + 8)p^ + o(^). 

Thus by changing the variable n <— n+m, for every m, n e N 
such that n > m we have 



<R + 2q2/9" + (2ai + 8)p^ + o( 



(23) 



1 



)• 



3j Combining the upper and lower bounds: Recall 
that f{n) is any function so that f{n) = o{\/n) and 
lim„_>oo ,f{n) = oo. Thus by letting m = f{n), as n grows 
large, we have m n and by using (fTTI i we have 

lim P(Z„ < 2-2 ) > R, 

n—^oo 



lim P(Z„ < 2- 



) > i?. 



Also, as lim„^oo fin) = oo, inequality (|2TI) is fulfilled as n 
grows large, and by ( |23] ), we have 



) < i?- 



And as a result. 



lim P(Z„ < 2-2 ) < R, 

n— f oo 

Therefore, since the limit of the upper and lower bound equals 
R, we get the result. 

To provepart (2), we first consider the process Z'^ = 1 — 
By using ^ we have 



= 1 - ^^+1 < 1 - zt < 2(1 - 



<{i-zlf 



71 2 



2^4 i if = 1, 

; if Bn = 0. 



Thus the process Z'^ with is stochastically dominated by 
the process given by (|7]) with Zq = Z'q. Also by using 
Lemma |8] for m S N we have 

p(z;„ < 2/9™) 

= p(i - z;„ > 1 - 2p"') 

= P(Z^ > 1 - 2/9™) 
= P(Z„ > Vl - 2p™) 

> P(^™ > 1 - p™) 

> 1 - /(M^) - 202^™. 
Similarly we obtain 

P(Z;„>l-2p™)>/(Ty)-«ip^. 

Using the above statements for the process Z'^ and going along 
the same lines as the proof of part (1), for i? < 1 — /(VF) we 
obtain 

lim P(Z; < 2-2 ) = i?, 

n— >-oo 

and by noting that = 1 — Z^ we get the result. 

C. Proof of Theorem |5] 

Let I be the set of chosen indices by the code C(n, _R), let 
l]\ the block to be transmitted (including the frozen bits), 
and let Y^" be the received vector Denote by P^fP(Af, K) 
the bit-error probability when we decode the z-th bit by the 
MAP rule. We have 

^P(A^, R) > max,ei{P^f (iV, R)} 

(b) 

> max,ei{i/([/, | Y,^ )} 

> max,gx{^(C/z I Yf,Ul-\ Uf^,)} 
= miix,ei{H{W,)}, 



where Wi is the channel seen by [/,; when we have the 
output Yi and all the other bits Ui, - ■ ■ ,Ui-i, Ui+i, ■ ■ ■ , C/2" 
available. To see step (a) consider the MAP decoder for bit 
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i. It has associated probability F^f'^{N,R) and is optimal. 
Compare this to the suboptimal bit decoder which first decodes 
the whole block and then extracts the i-th bit. The probability 
of error associated to this decoder is at most P^^^{N,R) 
since any time the block is decoded correctly also the i-th 
bit is decoded correctly. Therefore, for any i, F^f'^{N,R) < 
pMAP^'^^ i?). Step (a) follows by maximizing over i. Step (b) 
is Fano's inequality. Denote the number of Is in the binary 
expansion of i — 1 by wt(i). Then Wi is 



wt(i) times 



(24) 



As a result, Z{W^) = {Z{W))^""-" . Thus by using the 



inequality I{Wi)^ + Z{Wi)^ < 1 (HI), we have H{W^) > 
As a result, 

PrP(iV,i?) >max,ex{if(W^«)} 

,l+wl(i). 



>max,ei{-(Z(VK,))' '"'^"}- 



Since, |X| = 2"i?, the set I must contain an index i so that 
wt(i) < E{n,R). Therefore, 



'c \n,R)>^{Z{W)Y 



_2E(n,H) + l + log(- log(Z(W))) _j 



For the specific case of polar codes, we argue as follows; Let 
n G N, m = log 71. Also let < e < 1 be a constant. Using 
( fTSl l we obtain 

lim P(Z„+™ > I < 2^") 

n— ^-OD 

> 1- x + e. 



Also, using Lemma |6] we get 



lim P(Z,H 



< 2 



_2^i = "> + l " 



Zrn < 2p") = 1. 



As a result of the above two inequalities we have 



^ Zn-\-m^ 2 



\im¥{{Z{W)y 

n — >oo 

>l-x + e. 

Also, using the result of Theorem [3] part (a), it is easy to see 
that 

hm ¥{Z„+rn < 2 



I < 2p^) = X. 



As a result, given that Zm < 2p™, as n — > oo, the following 
two events have non-empty intersection 



As a result. 



hi < E{m + n,x-e) + 6(771). 

1=1 

Now since the intersection of An and i?„ is non-empty for 
large n, there exists a (61, • • • , &ri+m) G An with 

^ 6i < £'(771 + n,x-e) + 6(777). 

i=l 

And by letting e ^> and noting that X^S™ ^' ^ weight of 
some sub-channel, we get the result. 

IV. Appendix 

A. Proof of Lemma |S] 

In order to prove Lemma |8] we first need to state the 
following two lemmas and afterwards we give a proof of 
Lemma |8] 

Lemma 9: Let Zn be a process defined by Zq = zq G [0, 1] 
and 

2 



Zn 



;if Bn = 1, 



^ e [ZnV2-Zn\ 2Zn - Zj] ; if Bn = 0. 

Let Qn = Zn{\-Zn). Then 

Proof. We have 

J = Z„(l + Z„ ) ;ifB„ = l, 



Z„(l-Z„) ' Z„(1-Z„)J 



As a result 

< 



Z„y2^rzJ<a;<Z„{2-Z„) V ■^n(l — ^n) 



(a/ /!! ^\ } + Vgn(l + ^.) 



< 



, max { / + yipTiy}] 

/2-z2<a;<z(2-z),0<2<l V ' 



< 



L85 
2 

Therefore, 



A -{7 2«<"+'".-)+e(-) 



B„ = {iZ{W)f 



Zm. < 2p"'}. 



But the set An exactly represents the set of indices of the sub- 
channels needed in order to achieve rate R. Also, for every 

(^l,--- ,^m+n) e Bn WC haVC 



Lemma 10: Let En be the process defined by Eq = eo and 



En+l 



En 



;if Bn = 1, 



{Z{W)f 



< 2 



For 77 e N we have: 



(25) 



Or by applying the function log(— log()) to both sides we 
obtain 

n+m 

^ 6i < 777 + £'(77 + 777, X - e) + log(- \og{Z{W))). 

i.—7n-{-l 



P(K > 1 - 2-^'^'=' ''^ ) > 1 - 2V2^l - el- 
Proof We have: 

1 - £2+1 = l-E^,< 2(1 - £2) ;if s„ ^ 1, 



;if Bn^O. 



8 



Hence the process En = 1 — Ef-^ with the initial condition 
i?o = 1 — Gq is stochastically dominated by the process 
given by O and Zq — 1 — e^. Therefore, by ( fTOl i we have: 



>l-2V2Jl-e2 



Also, 



= > 1 - 2-^^'=' ) 

-2sr=i 



< P(-E„ > 1 - 2-^^=' *). 
As a result, we have 

HEn > 1 - 2-2^''=' ) > 1 - 2\/27l-eg. 



Using the above two lemmas, we now prove Lemma [8] Let 
pi = (j^)'^- Consider the process = — 
According to Lemma |9] and by using the Markov inequality 

nQn > Pi") = nQj > (Pi)*) < = p"- 

As a result. 



= P(Q„ <Pi") <P". 

Consider a partitioning of the interval [0,1] into the three 
intervals 



and define A, i? and C as 



S = p( Y < Zn < 



Also let A', B' and C be the fraction of A, B and C 
respectively that will eventually (as n oo) go to zero. 
Clearly we must have 

A' + B' + C' = P(Zoo = 0) = I{W). (26) 

Clearly B' < B < p" . To upper-bound C note that if we con- 
sider the process En given by dZSl l and Eq = Cq = ^+^^^'^p^ — 
then by (|6]l it is easy to see that P(i?oo = 0) is an upper bound 
on C". Thus we have 



< 2V6p? 



Therefore, 

P(Z„ < 



1- Vl-4pi^ 



= A 

> A' 

^ liW) -B' -C 

> I{W) - p" - 2V6pi^ 



As a result, since p > pi we have ^"'"^ < 2p", and we 
get 

P(Z„ < 2p") > /(VK) - (1 + 2V6)pi^. 

Thus part (a) now follows by letting ai = 1 + 2\/6. For the 
proof of part (b), let A", B" and C" be the fraction of A, B 
and C respectively that will eventually (as n — > oo) go to one. 
Clearly we must have 

A" + B" + C" = P(Zoo = 1) = 1 - I(W). (27) 

Clearly B" < S < p". To upper-bound A " note that if we 
consider the process Z„ given by Zq = ^-\/i^ipi _ ^jj^j 



Zn+l 



Zl ;ifi?. = l, 

2Z„-Z2 ;ifB, = 0, 



then F{Zoo = 1) is an upper bound on A" . Therefore we have 

can be bounded from below 

by 



\Zn > 



1 + VI - 4pi' 



= c 






> C" 






= 1 - 


I{W) 


-A" 


> 1 - 


I{W) 




> 1 - 


I{W) 





II rtll 



1 - VI - 2pi" 



And since p > pi, we get the result in a similar way as part 
(a) by taking a2 ~ 5. 
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